In mathematics, the Bauer–Fike theorem is a standard result in the perturbation theory of the eigenvalue of a complex-valued diagonalizable matrix. In its substance, it states an absolute upper bound for the deviation of one perturbed matrix eigenvalue from a properly chosen eigenvalue of the exact matrix. Informally speaking, what it says is that the sensitivity of the eigenvalues is estimated by the condition number of the matrix of eigenvectors.
Contents |
Let be a diagonalizable matrix, and be the non singular eigenvector matrix such that . Be moreover an eigenvalue of the matrix ; then an eigenvalue exists such that:
where is the usual condition number in p-norm.
If , we can choose and the thesis is trivially verified (since ).
So, be . Then . being an eigenvalue of , we have and so
and, since as stated above, we must have
which reveals the value −1 to be an eigenvalue of the matrix .
For each consistent matrix norm, we have , so, all p-norms being consistent, we can write:
But being a diagonal matrix, the p-norm is easily computed, and yields:
whence:
The theorem can also be reformulated to better suit numerical methods. In fact, dealing with real eigensystem problems, one often has an exact matrix , but knows only an approximate eigenvalue-eigenvector couple, (,), and needs to bound the error. The following version comes in help.
Let be a diagonalizable matrix, and be the non singular eigenvector matrix such as . Be moreover (,) an approximate eigenvalue-eigenvector couple, and ; then an eigenvalue exists such that:
where is the usual condition number in p-norm.
We solve this problem with Tarık's method: m (otherwise, we can choose and theorem is proven, since ). Then exists, so we can write:
since is diagonalizable; taking the p-norm of both sides, we obtain:
But, since is a diagonal matrix, the p-norm is easily computed, and yields:
whence:
The Bauer–Fike theorem, in both versions, yields an absolute bound. The following corollary, which, besides all the hypothesis of Bauer–Fike theorem, requires also the non-singularity of A, turns out to be useful whenever a relative bound is needed.
Be a non-singular, diagonalizable matrix, and be the non singular eigenvector matrix such as . Be moreover an eigenvalue of the matrix ; then an eigenvalue exists such that:
(Note: can be formally viewed as the "relative variation of A", just as is the relative variation of λ.)
Since μ is an eigenvalue of (A+δA) and , we have, left-multiplying by :
that is, putting and :
which means thatis an eigenvalue of, with eigenvector. Now, the eigenvalues of are , while its eigenvector matrix is the same as A. Applying the Bauer–Fike theorem to the matrix and to its eigenvalue, we obtain:
If A is normal, V is a unitary matrix, and , so that .
The Bauer–Fike theorem then becomes:
which obviously remains true if A is a Hermitian matrix. In this case, however, a much stronger result holds, known as the Weyl theorem.